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C^ , Consider a Markov chain on an infinite tree T = {V, E) rooted 

fT^ ■ at p. In such a chain, once the initial root state (t{p) is chosen, each 

vertex iteratively chooses its state from the one of its parent by an 
application of a Markov transition rule (and all such applications are 
independent). Let ^j denote the resulting measure for o{p) =j. The 
r^ ^ resulting measure pj is defined on configurations a — {a{x))x£V £ 

(~| ■ A^ , where A is some finite set. Let p^ denote the restriction of p 

to the sigma-algebra generated by the variables cr{x), where x is at 
distance exactly n from p. Letting an =ma,Xij£AdTvip'i',l^"), where 
rixv denotes total variation distance, we say that the reconstruction 
problem is solvable if liminfn^no Qn > 0. Reconstruction solvability 
^S] ' roughly means that the nth level of the tree contains a nonvanishing 

^ ' amount of information on the root of the tree as n ^ oo. 

C^^ . In this paper we study the problem of robust reconstruction. Let 

xl ' V he a nondegenerate distribution on A and e > 0. Let a be chosen 

^^T I according to p" and a' be obtained from a by letting for each node in- 

dependently, cr(«) — cr'{v) with probability 1 — e and cr'(w) be an inde- 
pendent sample from i/ otherwise. We denote by p"[u,e] the resulting 
^— V , measure on a' . The measure p" [v, e] is a perturbation of the measure 

p" . Letting an{u,e) — maxijg^dTv(A'"[^'^liM7['^'^])' ^^ ^^y that the 
reconstruction problem is v -robust-solvable if liminf„^oo anii^,B) > 
C^ ■ for all < £ < 1. Roughly speaking, the reconstruction problem is 

C ' robust-solvable if for any noise-rate and for all n, the nth level of the 

• • , tree contains a nonvanishing amount of information on the root of 

^, ' the tree. 

Standard techniques imply that if T is the rooted _B-ary tree 

(where each node has B children) and ii B\\2(M)\^ > 1, where A2(M) 

C^ ' is the second largest eigenvalue of M (in absolute value), then for all 

nondegenerate u, the reconstruction problem is //-robust-solvable. We 
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2 S. JANSON AND E. MOSSEL 

prove a converse and show that the reconstruction problem is not v- 
robust-solvable if _B|A2(M)| < 1. This proves a conjecture by the 
second author and Y. Peres. We also consider other models of noise 
and general trees. 

1. Introduction. In this paper we study the perturbative theory of re- 
construction on trees, and show how it depends on the spectrum of the un- 
derlying Markov chain. In particular, we show that the threshold for "robust 
reconstruction" for the S-ary tree is i?|A2(M)p = 1, where A2(M) denotes 
the eigenvalue of M which is the second largest in absolute value. In Section 

3 we prove a similar threshold for general bounded degree trees, where B is 
replaced by the branching number of the tree br(r). We refer the reader to 
Section 1.2 and to [3, 7, 21, 22] for background. 

1.1. Definitions and main results. We proceed with some formal defini- 
tions. Let T = (y, E, p) be a tree T with nodes y , edges E and root p ^V . 
We direct all edges away from the root so that if e = (x, y), then x is on the 
path connecting p to y. Let <i(-,-) denote the graph-metric distance on T, 
and Ln = {v £ V : d{p, v) = n} be the nth level of the tree. For x €V and 
e = {y,z) £ E, we denote |3;| = d{p,x), d{x, {y,z)) = inax{d{x,y),d{x,z)} and 
|e| = d{p,e). The B-ary tree is the infinite rooted tree, where each vertex 
has exactly B children. 

A Markov chain on the tree is a probability measure whose state space 
is A^ , where ^ is a finite set. Without loss of generality we assume that 
A = {1, . . . , q). Assume first that T is finite and let M = {Mij)ij^j\^ be a 
stochastic matrix. In this case the probability measure defined by M on T 
is given by 

(1) A^(0") =l{<7(p)=<?} n ^(T{x),a{y)- 

{x,y)€E 

In other words, in fli the root state cr{p) satisfies cr{p) = i and then each 
vertex iteratively chooses its state from the one of its parent by an appli- 
cation of the Markov transition rule given by M (and all such applications 
are independent). We can define the measure jli on an infinite tree too, by 
Kolmogorov's extension theorem, but we will not need chains on infinite 
trees in this paper (see [7] for basic properties of Markov chains on trees). 

Instead, for an infinite tree T, we let r„ = {Vn,En,p), where Vn = {x G 
V : d{x, p) <n},En = {e £ E: d{e, p) < n} and define /2" by (1) for T„. More 
explicitly, 

(2) p'l{(T) = l{a{p)=£} n ^(Tix),aiy)- 

{x,y)eEn 



RECONSTRUCTION ON TREES 3 

We are particularly interested in the distribution of the states cr{x) for x £ 
Ln, the set of leaves in r„. This distribution, denoted by /i^!, is the projection 
of /2" on A^" given by 

(3) ^^lia)=J2{jlU^):a\L^ = a}. 

In this paper we are interested in perturbative theory of the above pro- 
cess. Below we give three definitions of perturbations of ^,„ representing 
three different types of "noise." We call a distribution z> on A = {1, . . . ,q} 
nondegenerate, if i^(i) > 0, for all 1 <i <q. 

In the general setting the perturbation is obtained by observing, for leaves 
X £ Ln, not the state <t(x) but a state (in a state space B possibly different 
from A) derived from cr{x) by another random choice (independently for 
all leaves). The extra choice can be described by a stochastic matrix N = 
{Nij)i^_Aj^B', this defines a probability measure on A^" x B^" by 

(4) /i[iV]^(a,r) = l|„(p)=£} Y[ M^(^),^(j^) X [] iV„(j^),^(j^), 

and the distribution of our observed states is the projection /^[A^]" on B^" 
given by 

(5) n[N]^{T) = J2f,[N]na,T). 

We will mostly be interested in the following types of noise: 

• Given A; > 0, define N = M . Here, for each leaf independently, k addi- 
tional steps of the chain are performed. We write ^"[A;] for yu"[A^]. 

• Given a distribution v on A, define Ni^ = (1 — £)l{i=j} + ^I'j- Here, for 
each leaf independently, with probability 1 — e, there is no noise; otherwise, 
the leaf state is chosen independently from anything else according to v. 
We win write /^^[z^, e] for /^[A^]". 

• Given < e < 1, we let A^ be a g' x ((7 + 1) matrix defined by Ni^i = (1 — e), 
^i,q+i = e and iVjj = otherwise. Here, for each leaf independently, the 
state at the leaf is deleted with probability e (deletion is marked by q + 1). 
We write /i^[e] for i4[N]. 

Recall that for distributions /i and v on the same space Q, the total 
variation distance between ^ and u is 

(6) Z)y(/i,z.) = i5]|/x(a)-K^)|. 

Definition 1.1. (i) The reconstruction problem for the 5-ary tree T 
and M is solvable if there exist i,j G A, for which 

(7) liminfDy(/i^/i")>0. 
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(ii) The reconstruction problem for the B-aiy tree T and M is robust- 
solvable if for all A; < oo there exist i,j £ A for which 

(8) lim^inf Dv d-t^ [k] , ^i] [k]) > 0. 

(iii) Let ly he a nondegenerate distribution. The reconstruction problem 
for the B-ary tree T and M is i^ -robust- solvable if for all e < 1, there exist 
i,j £ A, for which 

(9) lim inf Dv (/x^ [u, e] , /x" [i/, e] ) > 0. 

(iv) The reconstruction problem for the B-aiy tree T and M is erasure- 
robust- solvable if for all e < 1, there exist i, j G A, for which 

(10) liminf L'K(/ir[e],/^?[e]) > 0. 

n — ^oo -' 

Note that by taking e = in (9) or (10) we obtain the original reconstruction 
condition (7). The same is true if A; = in (8). 

Let A2(M) denote the eigenvalue of M which has the second largest abso- 
lute value [A2(M) may be negative or nonreal]. In our main result we prove 
the following: 

Theorem 1.2. Consider an ergodic Markov chain on the B-ary tree 
such that B\X2{M)\'^ < 1. Then we have the following: 

(i) The reconstruction problem is not robust- solvable. Moreover, there 
exists k* such that for all k> k* , 

(11) max lim Dy(//"[^],/"?W) = 0. 

(ii) For all nondegenerate v, the reconstruction problem is not v-robust- 
solvable. Moreover, for all nondegenerate v , there exists e* < 1 such that for 

all e> e* , 

(12) max lim Dv{^Ji'^[v,e\,^Ji''[v,e\) = {). 

i,j n— >oo -I 

(iii) If all the entries of M are nonzero, then the reconstruction problem 
is not erasure-robust- solvable. Moreover, there exists an e* <\ such that for 
all e > e* , 

(13) max lim Dy(/i[^[e],/x"[e])=0. 

ij n^oo ' J ^ " 

It is easy to see that the total variation distances in (11), (12) and (13) 
are monotone decreasing in k,e and e respectively. 

The following proposition follows immediately from [12] or from the proofs 
in [22]. Together with Theorem 1.2, it shows that the threshold for robust 
reconstruction is given by B\\2{M)\'^ = 1. 
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Proposition 1.3. Consider an ergodic Markov chain on the B-ary tree 
where B\\2{M)\^ > 1. Then we have the following: 

(i) The reconstruction problem is robust- solvable. 

(ii) For all nondegenerate v the reconstruction problem is v' -robust-solvable. 
(iii) The reconstruction problem is erasure-robust-solvable. 

1.2. Discussion. The reconstruction problem was first studied in statis- 
tical physics [9, 28], where the problem was phrased in terms of extremality 
of the free measure for the Ising model on the (B + l)-regular tree (Bethe 
lattice). It is not too hard to see (e.g., [3]) that the measure is nonextremal 
if and only if the reconstruction problem is solvable for the Markov chain 
on the B-ary tree with transition probabilities given by the binary symmet- 
ric Markov chain: ( ^ -^ _ . j . (5 is related to the "inverse temperature" P 
by 1 — 26 = tanh(2/3). The equivalence between nonextremality of the free 
measure of a random field and reconstruction solvability of an associated 
Markov chain on the same tree holds under mild nondegeneracy conditions 
(see, e.g., [18]). 

In the past decade, the reconstruction problem reappeared in many ap- 
plications: In communication networks (see [3] and the references there), in 
noisy computation (a model introduced by von Neumann in [29], see [8, 4]) 
and in phylogeny (molecular evolution, see [5, 26] for general background) 
[27]. Most recently, it is shown that the reconstruction problem is of cru- 
cial importance to basic questions in phylogeny [19, 20, 23]. In all of these 
applications the interest is to find when is it possible to reconstruct some 
information on the root state from states at the leaves of a finite tree. In 
many of the applications it is natural to consider robust-reconstruction as 
the observed data goes via additional "noise mechanism." 

Solvability of the reconstruction problem is also closely related to the 
mixing rate of Glauber dynamics on the tree. See [1, 17], where it is shown 
that nonsolvability roughly corresponds to rapid mixing dynamics on the 
tree. 

Determining if the reconstruction problem is solvable or not turns out 
to be very hard. Binary symmetric Markov chains is the only family for 
which the threshold for reconstruction solvability is known. Even here there 
is a generation gap between the proof of the lower-bound [9] and proofs of 
the upper bound [2] (see also [10] for a different proof, [3] for the result on 
general trees and [24] for the critical case on general trees). For binary sym- 
metric Markov chains on the B-a,iy tree the threshold for the reconstruction 
problem is given by B{1 — 26)^ = 1, or, equivalently, BX2{M)'^ = 1. For all 
other families of Markov chain, including g-ary symmetric Markov chain for 
q> 2 and general 2x2 Markov chains, only bounds are known [15, 17, 22]. 
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The threshold B\X2{M)\'^ = 1 is also the threshold for "census-solvability" 
[22], where different nodes of L„ are indistinguishable (in other words, we 
only observe the "census" of level n). However, in general, it is not the 
threshold for reconstruction. Indeed, except for the binary symmetric chan- 
nel, we know of no family of chains for which B\\2{M)\'^ = 1 is the threshold 
for reconstruction. Moreover, [18] shows that for asymmetric binary Markov 
chains (general stochastic 2 by 2 matrices) or symmetric Markov chains for 
q>2 [where Mjj = (1 - '5)l{i=j} + ^Iji^j} for i, j G {1, . . . , q}], the recon- 
struction problem is sometimes solvable even when B\X2{M)\'^ < 1. In [18] 
there is also a construction of M with A2 (M) = for which the reconstruc- 
tion problem is solvable for large B. 

Why is determining the threshold for reconstruction hard? From the tech- 
nical point of view a Markov chain on the tree corresponds to a recursion in 
some random variables ([2, 17, 24]). A natural way to analyze these recur- 
sions is to use a perturbative argument around the stationary distribution 
of the chain. The main problem is that the random variables we start with 
are atoms — far from the stationary distribution — and that, in general, the 
recursions lack any convexity. For "robust-reconstruction" the problem is 
easier — as the recursions begin close to the stationary distribution. 

Our proof is based on a new measure of discrepancy for a vector of dis- 
tributions which is a weighted variant of the x^ distance. We show that an 
application of the chain M contracts the discrepancy by a |A2(M)p factor, 
and that if the discrepancy is smaller than 6, then tensoring B copies of the 
distributions increases the discrepancy by a factor of at most B(l + e{6)), 
where e{S) — > 0, as 5 — > 0. 

It is interesting to compare our results with the results of [25]. In [25] 
Pemantle and Steif study robust phase transition on trees. For a Gibbs 
measure on a tree we say that a robust phase transition occurs if the bound- 
ary conditions on a cutset have a nonvanishing effect on the root even when 
the interactions along the cutsets are made arbitrarily small but fixed (see 
[25] for exact definition). The main results of [25] give the exact threshold 
for robust phase transitions for general (bounded degree) trees for Potts 
and Heisenberg models in terms of the underlying model and the branching 
number (see [14]) of the tree. 

Both in our result and in the results of [25], it is easier to analyze the 
"robust" problem than it is the original problem for similar reasons. In both 
cases the "nonrobust" problem is hard to control without some convexity 
assumption, while the solution of "robust" problem allows the use of "local" 
arguments. 

Moreover, like robust phase transition, robust reconstruction is a geo- 
metric property, that is, for general bounded degree T, the threshold for 
robust-reconstruction depends only on br(T) and |A2(M)|. Indeed, the proof 
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of Theorem 3.3 combines the analysis of the new discrepancy measure in- 
troduced here, with some of the techniques developed in [25] for controlling 
recursions on general trees. 

A natural open problem is to determine the behavior of robust-solvability 
in the critical case, where B\X2{M)\'^ = 1. Our techniques shed no light on 
this problem. It is also interesting to try and remove the restriction that the 
entries of M are positive for (13); see also Remark 2.10. Finally, in the proof 
presented for Theorem 3.3, for fixed M and v, the bounds on e and k are 
becoming weaker as br(T)|A2(M)p approaches 1 (i.e., e — > 1 and /c— >oo). 
It is natural to ask if for given M, z/ and K, there exist e and k for which 
the result holds uniformly for all infinite trees T with br(T)|A2(M)p < 1. 

2. Proof. Recall that we denote by M the transition matrix. In this 
section we will often multiply M from the right by a vector of functions, 
from the left by a vector of measures — in which case the resulting vector 
would also be a vector of functions/vector of measures. 

Let 1 = (1, . . . , 1)*, then clearly Ml = 1. Let Vi be the stationary prob- 
ability of state i, and v = {vi, . . . ,Vq) the stationary distribution, so that 
vM = V. In the remainder of this section we will use a,b,c,... for column 
vectors, and UjVjW, . . . for row vectors, v will always denote the stationary 
distribution. 

Note that if 5 is a column vector such that vb = 0, then vMb = vb = 0. In 
other words, the linear space v = {b (zW^ :vb = 0} is invariant under M. 

Lemma 2.1. Let b = (bi,. . . ,bqY be a vector such that bi> for all i. 
Then 

1 "2 



Proof. By Jensen, 



1 1 ^V- 1 

{Mb), Ejmjb,-^ '''b,- 



Hence, 



as needed. D 

By looking at the Jordan form of M it is easy to see the following: 
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Lemma 2.2. Given e > 0, there exists an Euclidean norm \\ ■ \\ on v 
such that \\Mb\\ < (|A2(M)| + e)||6|| for all be v^. 

Let Q be the projection onto v^ defined by Qb = b — {vb)l [note that 
vQb = vb- {vb){vl) = for all b]. 

Definition 2.3. Let || -H be a Euclidean norm on w-*-. Let u = {ui, . . . ,fg) 
be a vector of distributions on a common space. Let f = (fi,- ■ ■ , fg) be the 
vector of density functions with respect to a cr- finite measure fi, such that 
z/j <C // for every i. In other words, dvi = fidn for all i. We then define the 
discrepancy of the vector by 

q 



^f.||(/)=/llQ/fEy^/^- 



We also write D{i/) and D{f) for the discrepancy, without explicitly indi- 
cating the norm and reference measure. 

Note the similarity between the discrepancy and the x^'distance. The 
X^-distance is known to be well behaved with respect to £2 norms. Note that 
if /i = • • • = /g, then Qf = fQl = 0. Thus, Q projects into the orthogonal 
complement of the space where the discrepancy should be 0. 

Lemma 2.4. D{i') is independent of the reference measure /i; that is, 
if f = ifii ■■■ifq) and g = {gi, . . . ,gq) are such that dui = fidfi = gi dfi for 
alii, then D^^y^^if) = D~^y^^{g). 

Proof. Assume that jl<^ ^. The general case then follows by consider- 
ing the three reference measures /x, fJ- + jl, ft. 

Since Q is linear and || • || is Euclidean, we may write HQ^lP = Yl'i j=i ^ij^ibj 
for some tij G M. Now 

^,nff^ V" , f fifj , v^ , f 9i9j{dfi/dl-i)^ . 

^ (^)= i:-rU,j—d,= EvrU,J^-^^j^d, 



TTL, J ffr 



r,i,j=l 



9r D 



Lemma 2.5. Let the norm \\ • || on v-^ satisfy ||M6|| < a\\b\\ for all 6 G f "*" 
and some constant a. Then D{M f) < a^D{f) for all f , as in Definition 2.3. 

Proof. For all b, 

MQb = M{b - {vb)l) = Mb- {vb)Ml 

= Mb- {vb)l = Mb- {vMb)l = QMb. 
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Therefore, we have pointwise that 



(14) 
Now 



\\QMff = \\MQff<a^Qf\\^ 



D{Mf) = J\\QMffJ2Tj^^df, 



(15) 



(16) 



<a' WQffT. 



t (Mfl 



■dfi 






where (15) follows from (14), and (16) follows by Lemma 2.1. D 

Lemma 2.6. For every Euclidean norm || • || on v , there exists a con- 
stant C = C{\\ ■ II), such that 



(17) 



Jijj 
fk 



dfi — 1 



<CD{f), 



for all i,j,k, where f and n are as in Definition 2.3. 



Proof. By Cauchy-Schwarz, 



JiJj 
fk 



dfi — I 



ifi - fk){fj - fk) 

fk 



dfi 



<W / \Il—I^dfi. 



I fj - fk 



■dfi. 



fk ^ J fk 

Therefore, in order to prove the lemma, it suffices to prove that there exists 
a constant C such that for all i,j,k, it holds that 

(18) f\AzJlLd^i<CDU). 

J Jk 

Note that 

{Qb)i - {Qb)j = {h- (i;6)l) . - (6 - {vh)l). = hi- b,. 
Therefore, for all b, it holds that, for some constant Cij, 
\bi-bj\ = \{Qb)i-{Qb)j\<Qj\\Qb\\. 
Hence, 



Jk -^ J Jk Vk 



< -^D{f). 
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Now (18) follows by taking C = supjj- /, -^. D 

Given a u-finite measure /i on a space X, we denote by //®^ the product 
measure on X^ with marginals /i. Similarly, if /j is a density of fj with 
respect to /i, write /® for the density of vf with respect to /i®^. Finally, 
for / = (/i, . . . , /,), write /^^ = (/f ^, . . . , /®^), and for u = {u^, . . . , i.,), 

write V® = {uf ,... , yf)- We similarly use {S)r=i ^'" ^^"^ the component- 
wise product of several vectors u^ of measures. 

Lemma 2.7. Let || • || he an Euclidean norm on v^ , B > 1 an integer 
and e > 0. Then there exists a 6 >0 such that if u^ = (i/|, . . . , u^), . . . , v = 
(i/f , ...,v^), satisfy D{v') <Sforl<i<B, then 

D(<^uA<{l + e){D{u') + --- + D{u^)). 

In particular, given e > 0, there exists a 5 > such that if v = {vi, . . . , Vq) 
satisfies D{v) < 5, then D{u®^) < (1 + e)BD{v). 

Proof. The second part of the lemma immediately follows from the 
first part. Choose a reference measure /i with vl <^ fi for every i and r, and 
let f[ be the density du^ /dfi. 

As in Lemma 2.4, we may write HQftp — J2ij'^i,j^ibj- Moreover, since 
||Q1||2 = ||o||2 = 0, it follows that Eij *i,i = 0. Hence, 

Dif)=[\\Qfftfdt, 
•' k=i J^ 

(19) =Y.''kti,j m^dji 

i,j,k •> jk 

Substituting (^,.=1 f^ ™ (19)) ^^ obtain, using the reference measure n®^ , 

d' 

(20) 
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Let C be chosen to satisfy (17) in Lemma 2.6, and C = CJ2i j l^ijl- Let 5 
be chosen such that for all (xi, . . . , xb) £ [1 — C5, 1 + C(5]^, it holds that 



(21) 



< 



^ L^r=l \^r 1| 



c 



Y[xr- l-^{Xr- 1) 
r=l r=l 

By Lemma 2.6, it follows that if D{f) < 5, then for all i,j, k, 



fk 



■dfi — 1 



< CD{f) < C6. 



Therefore, it follows from (20) and (21) that 

\r=l / i,j,k,r ^•' Jk / O j ,^.;,,^ 



flf^ 



fl 



^d^i-l 



(22) 



r ^ i,j,k,r 



•'i •' J J 1 

-7f^d/i-l 

Jk 



<j2Din + l E ^fciiMic^(r) 



c 



i,j,k,r 



= il + e)J2Din, 

r 

where inequality (22) follows from Lemma 2.6. □ 

Lemma 2.8. Given a Euclidean norm || • || on v^, there exists a constant 
C(|| • II) < oo such that for any vector v = {i'i)'-^i = {fi dfi)1^^ of distributions 
we have 

(23) sup d-TY {ui,Uj)= sup / \fi- fj\dn< C\/D{f), 

where f = (/i,...,/g). 

Proof. By Cauchy-Schwarz, 

\fi- fj\dfj.< ' 



^ dfi\l I fidfi 



Ji Jj r 

Ji 



and (23) follows from Lemma 2.6. D 

Lemma 2.9. (\) Let || • || be a Euclidean norm on v^ . Let n be a proba- 
bility distribution on 1, . . . , g such that fi{i) > for all i. Then for all 5 > 0, 
there exists an e* = £*{S) < 1 such that for any vector u' = {v'l, . . . ,u'q) of 
probability distributions on 1, . . . ,q and for all e > £* , if 

^ = (1 - £){^'i, ■■■, ^'q) + e(/^, • • • , At), 
then Div) < 5. 
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(ii) Let II "11 be a Euclidean norm on v . Let vf denote the £th row of 
M^ , that is, the distribution of the chain given by Af , after r steps starting 
at i, and v^"^' = {^i , • • • , Vq ). Then for all 5 > 0, there exists an r* = r*{6) 
such that if r>r* , then D{v^'^>) < 6. 

(iii) Let || • || be an Euclidean norm on v^ . Let u = {ui,. . . , Vq) be a vector 
of probability measures such that for all 1 < z,j < q, it holds that I'iij) > 0. 
Let u^ = {i^f, . . . , uV) be a collection of probability measures on {1, . . . , g + 1} 
such that v\ = (1 — e)fj + ei^', where v' is the delta measure on g + 1. Then for 
all 5 > 0, there exists an e* = €*{5) < 1 such that if e> e* , then D{u'') < 6. 

Proof. For the first part of tlie lemma we use the representation of 
D{i') as in (19) with respect to the measure /i. Let m = min{//(l), . . . , fj,{q)}, 
and observe that if di^i = fid^, then 

, dui ^du'i 

fi = -^=e + [l-e 



so 



and for all i,j,k, 



Hence, by (19), 



dfi dfj, 

e<fi<e + {l-e)/m, 

f^fj ^ {e+{l-E)/mf 
fk ~ e 



(24) i,M<f(i±ll^£)M_i)2|,,|, 



and the right-hand side of (24) converges to as e — > 1 . 

The second part of the lemma follows from the first one, as the ergodicity 
of M implies that for all i, u^ converges to the stationary distribution of 
the chain as m ^ oo. 

The third part of the lemma is proven similarly to the first part. Let 
m = minjjg^ i/i(j) and 

/ (l-e)>^ 

^ 1=1 

Note that if di'f = fidfi^, then m < fi{i) < q ioi I < i < q and all i, and 
fi{q + 1) = 1 for all i. It follows that for all i,j, k and 1 < £ <q, 

rn^^mMl<'i_ and :Mi±M(i±l) = i 
q - fk{i) -m fk{q + l) 



RECONSTRUCTION ON TREES 13 



Moreover, fi'^{q + 1) = e and /U^({1, • • • , q}) = 1 — e, so for all i,j and k, 
^'^'df,-l= f(Ml-i)dfi<(^-l){l-e). 



fk J \ fk J \m 

Hence, by (19), 

(25) D(^^) = ^.,t,//M^^_i]<(^^_lVl_,)^|t^_^.| 

i,j,k \-' Jk / \ / ,-j 

and the right-hand side of (25) converges to as e — > 1. D 

Proof of Theorem 1.2. The basic idea of the proof is that if ij.[N]"' is 
the vector of probability measures {iJ,[N]i, . . . , /^[-/V]^ ) defined in (5), then we 
may write fi[N]^~^^ in terms of /i[A^]" using the operator M and tensoring. 
This will allow us to bound discrepancies recursively. Let pi,. . . ,pB be the 
B children of p in the i?-ary tree. Write En+i{s) for the edges in En+i that 
are on the subtree rooted in ps (formally, these are the edges of -En+i that 
are connected to p only by paths going via ps). Define L„_|_i(s) similarly. 
Finally, for a configuration a of the vertices at the first n level of the tree, 
let (Ti , . . . , ctb denote the configurations restricted to the subtrees rooted at 
pi,...,PB- Then by (4) and (5), 

1^ (x,y)e_B„+i yeL„+i 



(26) 



B / q 

= n II ^^.^' J2 ^Ws{ps)=i'} n M^six),cTs{y) 

s=l\i'=l <Js {x,y)eE„+i{s) 

^ n ^cTs{y),rs{y)] 
yeL„+i{s) / 

s=l\i'=l I 

Note that the expression in the parenthesis in (26) is given by (M(/i[A^]"))£(r| 
L„+i(s)), the £th coordinate of the vector M(/i[A^]"). It is now easy to see 
that 

(27) pXNX"^^ = {M{p[Nf)f^. 

We use (27) in order to bound discrepancies recursively. 

The assumption B\X2{M)\'^ < 1 implies by Lemma 2.2 that there exists 
an e > and a norm || • || on t;-*- such that for all 6 € t;"*", it holds that 
||M6|| < a||6||, where B{l-\-£)a'^ < 1 — £• Let 5 be chosen as to satisfy Lemma 
2.7, so that if D{f) < 5, then Z?(/®^) < B{1 + e)D{f). 
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By Lemma 2.9 it follows that there exists a k* such that ^u*^ = {fii[k], . . . ,iJ.^[k]) 
satisfies £»(/) < <5 for ah k > k*. Write /i" for {n1[k], . . . ,/i^[fc]). Thus, (27) 
implies that /u""*"^ = {Mfi"-)^^ . It now follows by Lemmas 2.7 and 2.5 that 

L»(/i'^+i) < B{1 + e)a2z)(^i") < (1 _ e)D{fi''). 
Hence, lim„^oo D{^^) = 0. We therefore conclude from Lemma 2.8 that 

lim max(iTv(Mr[^];/"?W) =0> 

and (11) follows. 

In order to prove (12), let z^ be a nondegenerate measure and note that by 
Lemma 2.9 it follows that there exist an e* < 1 such that ^^ = (/^^[z^, e], . . . , 
/ig[i/, e]) satisfies D{ijP) < 6 for ah e > e*. Now (12) follows similarly to (11). 

The proof of (13) is similar. We look at fi^ = {fJ.\{e\, . . . ,//q[e]). Note that 
(27) imphes that //^ = {M^i^)®^. Let u = Mfi^. Note that the vector u sat- 
isfies for all i that ^{{q + 1) = e and fj(j) = (1 — ^)Mij otherwise. 

Since all the entries of M are positive, it follows from Lemma 2.9 that for 
every 5' > there exists an e* < 1 such that if e > e*, then D{v) < 5' . We 
may now apply Lemma 2.7 and choose 6' > in such a way that 

The rest of the proof is identical. D 

Remark 2.10. It is an interesting goal to extend (13) to general ergodic 
chains (where some of the entries of M may be zero). Above we proved this 
for the case where all the entries of M are positive. 

The proof of Lemma 2.9 can easily be extended to the case when there 
exists an n such that the measures ^",...,/ig all have the same support 
[in such a case one can prove that there exists a value of e < 1 such that 
D{fi"') < 5 by showing that for e sufficiently large, the measures //f have most 
of their mass on the atom (g + 1, . . . , g + 1) and bounded relative densities 
elsewhere]. However, we do not know any simple characterizations of the 
matrices M for which this holds (it evidently depends only on the set of 
zero entries of M); nor we do believe that this property is necessary for 
nonerasure-robust-solvability. 

3. General trees. Our results readily extend to general infinite bounded 
degree trees, where B is replaced by br(r), the branching number of the tree. 
In [6], Furstenberg introduced the notion of the Hausdorff dimension of a 
tree. Later, Lyons [13, 14] showed how many of the probabilistic properties 
of the tree are determined by this number which he named the branching 
number. 
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For our purposes it is best to define the branching number via cutsets. A 
cutset S for a tree T rooted at p is a finite set of vertices separating p from 
oo. In other words, a finite set 5" is a cutset, if every infinite self avoiding 
path from p intersects S. An antichain is a cutset that does not have any 
proper subset which is also a cutset. 

We follow the notation of [25] and for a cutset S, write Ins(S') for the 
inside of S (the finite component of T \ 5, containing the root p), Ins£;(5) 
for edges inside S [those edges {x,y) having x G Ins(5')] and Out(S') for the 
outside of S [Out{S) =T\{SU Ins(5))] . 

Definition 3.1. The branching number of T is defined as 
br(r)=infiA>0: inf Va-I^I=oI. 



Note that 



br(r) = inf <! A > : .inf ^X!^"''''=0 

"xes J 



antichains S — ' 



By Min-Cut-Max-Flow, br(r) is also the supremum of the real numbers 
A > 0, such that T admits a positive flow from the root to infinity, where 
on every edge e of T, the flow is bounded by A~I^L It is shown in [14] that 
br(T)~^ is the critical probability for Bernoulli percolation on T. See [14] 
and [3] for equivalent definitions of br(T) in terms of percolation, cutset 
sums and electrical conductance. We note that br(TB) = B for the i?-ary 
tree Tb- 

As in Section 1, the Markov chain on T is described by an |^| x |^| 
stochastic matrix M and the perturbations by an |^| x \B\ stochastic matrix 
N. For B-aiy trees, we observed the process on the special antichains L„; 
for general trees, it seems more natural to consider arbitrary antichains. The 
distribution p[N]^ of the observed (perturbed) states on an antichain S in 
T is given by, extending (4) and (5), 

(28) fi[N]l{T) = J2 Uaip)=e} n M-i^Mv) X n ^-iy),r{y) • 

^ {x,y)&nsE(S) yes 

We proceed by defining /xf , the measure Pi[k] for /c > 0, the measure /uf [z^, e] 
for e > and nondegenerate distribution f on A and fi^ [e] . This is done in 
exactly the same way as in the case of the B-ary tree, by choosing appro- 
priate A^'s in (28). 

We say that the reconstruction problem is solvable if there exists i,j G A, 
for which 



inf . Dv{fit,fi])>0, 

b antichain ■' 
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where /if denotes the conditional distribution on as given that o"(p) = i. 
We similarly define the notions of robust-solvable, z^-robust-solvable and 
erasure-robust-solvable. 

Remark 3.2. The definitions of solvability for general trees and B-ary 
tree are not compatible. If T is the i?-ary tree, then solvability by Def- 
inition 1.1 involves only cutsets S = L^ and is therefore a weaker condi- 
tion than solvability defined here, which involves all antichains (same for 
robust-solvable etc.). However, we will obtain the same threshold for robust- 
reconstruction under both definitions. 

The proof of our main result extends to show the next theorem. 

Theorem 3.3. Consider an ergodic Markov chain on a rooted tree T 
such that br(T)|A2(M)p < 1. Then we have the following: 

(i) The reconstruction problem is not robust-solvable. 
(ii) For all nondegenerate v, the reconstruction problem is not v-robust- 
solvable. 

(iii) If all the entries of M are nonzero, then the reconstruction problem 
is not erasure-robust-solvable. 

This proves that the threshold for robust reconstruction is given by br(T) x 
|A2(M)p = 1 as the proof of Theorem 1.4 in [22] immediately generalizes to 
show the following proposition: 

Proposition 3.4. Consider an ergodic Markov chain on a tree T such 
that br(T)|A2(M)p > 1. Then the reconstruction problem is robust- solvable, 
for all nondegenerate v the reconstruction problem is v -robust- solvable and 
the reconstruction problem is erasure-robust-solvable. 

We now turn to the proof of Theorem 3.3 which generalizes the proof 
of Theorem 1.2. For a vertex x of the rooted tree T, we write T{x) for 
the subtree rooted at x, that is, the subtree consisting of x and all of its 
descendents. We will use the following lemma from Pemantle and Steif [25]. 

Lemma 3.5 ([25], Lemma 3.3). Assume that br(T) < g. Then for all 
e > there exists an anitchain S such that 

^^ /I \ l^'l 
(29) Ei^j ^^' 



Aa 



and for all y G SU Ins(5), 

(30) s 1 <1. 

x&SnT{v) ^^^ 
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Proof of Theorem 3.3. We will show that under the conditions of 
Theorem 3.3 the reconstruction problem is not robust-solvable. 

Let S be an antichain and y £ S L) Ins(S'). Consider the Markov chain 
on the subtree T(y), starting with state i at y, and let fJ,[N]f be the 
distribution of the observed states on T{y) n S. Thus, 

(31) fl[N]f{T)=Y,l{^^y)=,} n M„ix),ai-)X n ^-W,r(.)- 

f^ ix,z)€lnsE{S) zeT{y)nS 

xeT(y),z£T{y) 

We write fi[N]y'^ for {n[N]f^, . . . ,/i[A^]^'^). Note that fi[Nf = fi[N]P^^ for 
any antichain S. 

The proof of Theorem 1.2 easily extends to show that if y G Ins (5) and 
zi, . . . ,zb are the children of y, then 

B 

(32) /iJ''^[iV]=(g)(A'V'-'^[iV]). 

r=l 

We will prove the theorem by recursively analyzing discrepancies via (32). 
We will prove the result for robust-solvability and indicate the modifications 
needed for other cases at the end of the proof. 

Below, we will write /x^''^[fc] for the measure ^^'^IN], where A^ = M^. Note 
that if S is an antichain and v £ S, then the measure /i^' is a measure on 
a single node. We may therefore apply Lemma 2.9 and conclude that for all 
6 > 0, there exists a k* such that for k>k*, for all antichains S and y G S, 
it holds that 

(33) D{fiy^^[k])<6. 

Since br(T)|A2(M)p < 1, there exist, by Lemma 2.2, an e > and a norm 
II • II on t)-*- such that for all b G v^, it holds that ||M6|| < a||6||, where (1 -|- 
e) br(T)a^ < 1 — e. Recall that there is a uniform bound K on the number of 
children of vertices of T. Let 6 be chosen as to satisfy Lemma 2.7 for every 
B<K, so that if D{r) < 6 fov r = I, . . . ,B, and B < K, then D({8)f=i D < 

(l + e)E?=i^(r). 

Lemma 3.5 implies that there exists a sequence of antichains Sn such that 

(34) lim y [{l + e)a^p = 0, 

XGOn 

and that for all n and y £ SnU Ins(S'.„), 

(35) Yl [{l + eWP~^y\<l. 

x&Sr,nT{y) 
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We will now show by induction (on s — \y\, where s = maxx^s l^;]), that 
for all antichains S = Sn and all y G S" U Ins(5'), for k> k*, 

(36) D{^y^'[k])<6 Y. [(i + e)«']i"i-i^i. 

xesnT(y) 

The case where y £ S follows from (33). This also proves the base of the 
induction. For the induction step, it therefore suffices to consider y E Ins(S') 
such that the children of v denoted zi, . . . ,zb satisfy the induction hypoth- 
esis. By Lemma 2.5 and the induction hypothesis, for all r, 

(37) D{Mfi'^'^[k])<a'^D{fi'^''^[k])<6a^ ^ [(1 + e)^^]!^!-!"'-!. 

xesnT{zr) 

The right-hand side of (37) is bounded by 5 by (35), since a < 1. Therefore, 
we may apply Lemma 2.7 with (32) and (37) to obtain 

(38) D{^y^^[k])<(l + e)J2DiM^,^-^) 

r=l 

B 



<{l + e)6a'Yl E [(1 + ^) 



2l\x\~\Zr\ 



a 

r=lxeSnT(zr) 

= 6 Yl [{l+e)aY^~\y\, 

x&Sr\T{y) 

proving (36). 

Applying (36) for the root p and 5„, we get by (34), taking n — > oo, 

limsupD(/i?'^"[A:],...,/i^'^"[A:])<(5 lim ^ [(1 +e)Q2]l^l = 0, 

XkzOn 

which implies by Lemma 2.8 that 

max lim DT^^{p,f'^[klp,f-[k])=Q, 

as needed. 

The proof for ;/-robust-solvability is exactly the same. The proof for 
erasure-robust solvability requires the following modification. 

First, as in the proof of Theorem 1.2, we may find e* < 1 such that if e > e* 
and B < K, then for all y, which have B children zi,. . . ,zb in a cutset S, 
it holds that D(//^''^[e]) < 5 (with 5 as above). 

If S is an antichain, let S' denote the set of children of S. Note that S'^ is 
an antichain for all n. We prove by induction that for all antichains S = Sn 
and all y G S" U Ins(5), for e > e*, 

(39) D{ny^'^'[e])<S Y. [(l + e)"^]'"^'"'^'- 

xeSr\T{y) 
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The proof is again by induction on s — |y|, where s = niax2;g5 |a;|. The only 
difference is in that for y G 5, we use the estimate D{^y' [e]) < 5. The 
remainder of the proof is the same. D 
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